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Abstract — In the present paper, a modification of tlie Index-less 
Indexed Flasli Codes (ILIFC) for flash memory storage system 
is presented. Although the ILIFC proposed by Mahdavifar et al. 
has excellent worst case performance, the ILIFC can be further 
improved in terms of the average case performance. The proposed 
scheme, referred to as the layered ILIFC, is based on the ILIFC. 
However, the primary focus of the present study is the average 
case performance. The main feature of the proposed scheme is 
the use of the layer-based index coding to represent indices of 
information bits. The layer index coding promotes the uniform 
use of cell levels, which leads to better average case performance. 
Based on experiments, the proposed scheme achieves a larger 
average number of rewritings than the original ILIFC without 
loss of worst case performance. 

I. Introduction 

The emergence of studies by Jiang, Bohossian, and Bruck 
lIUlSl on flash codes gave rise to a new field of coding. They 
modeled a flash memory as a Write Asymmetric Memory 
(WAM) and proposed flash codes for improving the worst 
number of rewritings. These studies attracted the interest of a 
number of coding theorists and inspired subsequent research 
on flash codes. In 2008, Yaakobi, Vardy, Siegel, and Wolf 
presented flash codes based on the enhanced multidimensional 
construction and they proposed a novel criterion, called 
write deficiency, for flash codes. 

Most flash codes proposed thus far are designed to optimize 
the worst case performance, such as the write deficiency 
lElEllSlQ. On the other hand, recendy, a flash code to 
improve the average number of rewritable bits was reported 
|l6l m . The average number of rewritable bits is the 
average number of allowable bit flips between consecutive 
erase operations. The flash codes based on the Gray codes 
proposed by Finucane, Liu, and Mitzenmacher |5| exhibit 
excellent average performance. They also presented a method 
for analyzing the average number of rewritable bits, which 
is based on a Markov chain model constructed from the state 
diagram of the code and a probabilistic model for the rewriting 
process [5 |. In the lifetime of a flash memory, it is expected 
that poor average performance will result in early collapse 
of the cells. In this respect, flash codes should be designed 
to improve not only the worst case performance but also the 
average case performance. 

The Index-less Indexed Flash Codes (ILIFC) proposed by 
Mahdavifar, Siegel, Vardy, Wolf, and Yaakobi [7| achieve 
excellent worst case performance. The ILIFC has been proven 
to provide almost optimal write deficiency. The prominent 



feature of the ILIFC is that a sub-block of cells represents both 
the value of an information bit and the index of the bit. This 
feature leads to simple encoding and decoding procedures. 

In the present paper, we present an improvement of the 
ILIFC in terms of average case performance. Since the ILIFC 
is designed based on the worst case performance, there is 
room for improving the average case performance without 
incorporating drastic changes in the original algorithm. 

The main concept of the scheme proposed herein is the 
use of layer-based index coding. In the original ILIFC, if a 
sub-block represents an index, the index cannot be changed 
to another index until the next erase operation. If bit flips 
in the information vector occur according to a nonuniform 
distribution, the difference between cell levels tends to be 
large. It is not trivial to balance the cell levels if most of 
the sub-blocks have their own indices. The layer-based index 
coding enables the encoder to change the index of a sub- 
block, and this feature provides flexibility for adjusting the 
differences between cell levels. 

In the present paper, the layered ILIFC, which uses the 
layer-based index coding, will be proposed, and its average 
case performance will be evaluated through computer simula- 
tions and a Markov chain method designed for analyzing the 
average case performance of variants of the ILIFC. 

II. Preliminaries 

In this section, some basic assumptions on flash memories 
and a rewriting model are introduced according to 

A. Flash memory 

A flash memory contains several cells that can store elec- 
trons. The charge in a cell can be increased through the 
injection of electrons. The process of increasing the cell level 
is referred to as cell programming. A cell can represent q-ary 
values. For example, flash memories with q = 2 and 4 cell 
levels have been realized in commercial products. The erase 
operation is the operation to remove the charge from cells in 
an erase block, which is a set of cells. Note that the erase 
operation can only be applied to all of the cells in an erase 
block. In other words, a cell cannot be erased individually. 

There is a limitation in the number of rewritings for a 
cell. If the number of rewritings exceeds this limit, the cell 
tends to operate incorrectly. Therefore, an appropriate coding 
for reducing the number of rewritings is required in order to 
lengthen the lifetime of a flash memory fSlfTl. 



B. Rewriting model 

In this subsection, a simple rewriting model used throughout 
the present paper is introduced |2|[7|. The information vector 
(wi, U2, • • • , Wfc) is a binary fc-tuple, which is to be stored in 
an erase block. The initial state of the information vector is 
assumed to be {vi,V2, ■ ■ ■ ,Vk) = (0, 0,---,0). The binary 
information in the information vector is written into cells 
through several rewriting processes. It is assumed that a 
rewriting process occurs when any one bit in the information 
vector flips. 

The cell state vector for an erase block is denoted by 
(ci,C2, • • • ,c„), where Cj £ {0,---,g - G {!,..., n}). 
The symbol cj represents the state of the jth cell. The contents 
of the information vector are stored in the cell state vector. 
The initial state of the cell state vector is also assumed to be 
(C1,C2, ■ • • ,c„) = (0, 0, • • • , 0). 

Only two operations to change the state of a cell state 
vector are allowed, namely, cell programming and the erase 
operation. Cell programming increases a cell level by one until 
the level reaches q—1. Suppose that there are two state vectors 
c = (ci, C2, • ■ • , c„) and c' = (c^, Cj, ■ • ■ , c^). If Cj > c'j holds 
for any j e {1, 2, • • • , n}, then c is said to be higher than c'. 
These assumptions mean that the allowable state transitions of 
an erase block are a transition from a lower state to a higher 
state. An erase operation forces the cell state vector to be a 
zero vector 

A goal of the flash codes is to reduce the number of erase 
operations to be as small as possible, under the assumption 
that the number of bit flips of information bits is fixed. 

III. Index-less indexed flash codes 

In this section, a brief introduction of the ILIFC as reported 
by f?! is presented. Although an encoding and decoding 
process of the ILIFC consists of multiple stages, for simplicity, 
we herein focus only on the first stage. 

Assume that an erase block contains n-cells and that these 
cells are divided into sub-blocks of the same size and any 
remaining cells. The size of a sub-block is assumed to be k 
satisfying n > k^, and k mod 2 = 0. The number of sub- 
blocks is given by m = [n/fcj, and the number of remaining 
cells is n — km. The length of the information vector is k. The 
key feature of the ILIFC is the representation of individual bits 
in the information vector Each sub-block in an erase block 
represents both an index and a value of an information bit. 

Let us denote the state of an erase block as (a;i | a;2 | • • • | 
Xm) where Xi G {0, . . . ,q — £ {!,..., m}) is the ith 

sub-block. For a sub-block x = (ci, C2, • • • , Cfc), the weight 
and the parity of x are defined as wt{x) — ci + C2 + ■ ■ ■ + Ck, 
and parity(a;) ~ wt{x) mod 2, respectively. 

The following terminology is used throughout the present 
paper. A state of a sub-block is said to be 

1) full if all cells in the sub-block have level q — 1; 

2) empty if all cells in the sub-block have level 0; 

3) active if the sub-block is neither full nor empty; 

4) live if the sub-block is not live. 
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Fig. I. Index coding for the ILIFC (fc = 4, g = 3). Tlie variable i expresses 
the value of the index. Whenever infonnation bit Vi turns, a state transition 
(an arrow) from a cell state vector to a higher cell state vector occurs. 



A. Index coding for the ILIFC 

A sub-block represents a single bit in the information vector 
The details are as follows. Let / : {0, . . . ,q — 1}'^ — 5- 
{0, 1, 2, . . . , fc} be an index map that converts a state of a 
sub-block into an index of the information vector In |j21, the 
index map is given by 

I(x) = / ^'^SmaXig{i^...^fe}[Q - C(,_2 mod fe)+i], x^O 
1 0, otherwise, 

(1) 

where x = (ci, C2, • • • , Cfc). Note that I{x) = implies that 
the sub-block x has no index. The bit value map for a sub- 
block: V : {0, . . . , q—1}'^ {0, 1} is given simply as V{x) = 
parity (a;). 

Based on the definition presented above, a sub-block Xi can 
represent both an index I{xi) and the bit value V{xi). At any 
moment during an encoding process (to be described later), the 
ILIFC encoder maintains the consistency as follows: = 
V{xi) if I{xi) 7^ for i = 1, 2, . . . , fc, where (ui, . . . , Vk) is 
the information vector. A coding that achieves this consistency 
is referred to as an index coding. 

In the ILIFC, the index coding for a sub-block is carefully 
designed in order to avoid an early erase operation. Figure 
[T] illustrates the process for the case in which fc = 4 and 
g = 2. In Fig. [T] the variable i corresponds to the index. 
The position at which the level of the cells begins to rise 
corresponds to the index i and is consistent with the definition 
of /(•). Furthermore, a bit flip on Vi induces an increment of 
a cell value that corresponds to the arrow shown in Fig. [T] 
Note also that the states of two active sub-blocks cannot have 
the same state if the indices of two sub-blocks are different. 

B. Pseudo-code for the ILIFC encoder and decoder 

The decoding and encoding algorithms for the ILIFC are 
shown in the form of a pseudo-code according to liTJ. 
Decoding map 

{vi,v2, ■ • ■ jWfc) = (0, 0, ■ • ■ ,0); 



for {j = l;j < m;j = j + 1} 
if (active(a;j)) 

{ i = read_index(a;j);ui — parity(a;j);} 



Encoding map 

y = {yi \ y2 \ ■ ■ ■ \ ym) = {xi \ X2 

for (j = l;j < m;j = j + 1) { 
if (active(a;j) A (read_index(a;j) = 
{write(yj);return(y);} 

} 



for ij = l;j < m;j = j + I) 
if (empty(a;j)) 

{write_new(i, yj);return(y);} 
return E; 

The symbol E represents the erase operation. The details of 
the functions, such as active( ), can be found in Q- 

IV. Layered ILIFC for improving average case 

PERFORMANCE 

In this section, we propose a modified ILIFC to improve the 
average case performance. A significant difference between the 
original ILIFC and the modified ILIFC is the index coding for 
a sub-block. The modified ILIFC introduces the concept of the 
layer in its index coding. The layer in a sub-block enables us 
to reset a sub-block, which means that an index embedded in 
a sub-block can be changed to another index. This flexibility 
promotes the uniform use of cell levels in an erase block and 
leads to an improvement in the average case performance. 

A. Outline of Layered ILIFC 

In the following, for simplicity, we assume that an erase 
block contains exactly n — cells. 

In the encoding process of the original ILIFC, the index 
of a sub-block is fixed until the next erase operation occurs. 
Thus, if several bits in the information vector are frequently 
rewritten, the sub-blocks corresponding to these bits tend to 
become full in early phase. In other words, local rewritings 
in the information vector induce an imbalance of cell levels 
between the sub-blocks. This imbalance tends to cause an early 
erase operation. In order to overcome this problem, maintain- 
ing the balance of cell levels for any rewriting sequence is 
crucially important. 

In the modified ILIFC, the cells in a sub-block are pro- 
grammed from the bottom layer to the top layer If a layer 
is filled (or programmed), we may alter the index of the sub- 
block. In the following, we refer to the proposed modified 
scheme as the layered-ILIFC (L-ILIFC). 

B. Index coding for Layered ILIFC 

In the following, we assume that k is an even number 
Suppose that a cell Ci has the value I G {0, ... ,(7 — 1}. In 
such a case, the cell c; is said to be in layer I. In other words, 
cell levels are divided into g-layers. 

Next, we introduce some additional considerations regard- 
ing the state of a sub-block. If all of the cells in a sub-block 
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Fig. 2. Index coding for tlie layered ILIFC (k = 4, q = 3); The variable i 
expresses the value of the index. Whenever information bit Vi turns, a state 
transition (an arrow) from a cell state vector to a higher cell state vector 
occurs. 



belong to the same layer I e {0, 1, . . . , q — 2}, then the sub- 
block is said to be clear. 

We first explain the concept of index coding using the 
following example. The details of the index coding will be 
presented in Subsection IIV-CI Figure |2] presents the details of 
the index coding for k = 4 and q — 2. Initially, the cell is 
empty (empty boxes at the top in Fig. |2]l. A single bit change 
in the information vector corresponds to the arrows emerging 
from the empty state. An index i (from 1 to 4) is written in 
a sub-block by incrementing the value of the ith cell, and the 
parity of the sub-block becomes one. 

It is readily observed that two more changes in the infor- 
mation vector transforms the state of the sub-block into the 
clear state. Namely, all of the cells belong to the same layer. 
A clear sub-block has no index. Note that we can write any 
index into a clear sub-block, as shown in Fig. |2] 

The index map for this index coding is, thus, given by 



I'ix) = 



0,Vi,j e {!,..., fc},Ci = Cfc,(i.e., x is clear) 



argmaxjg{i,..._fc}[ci - C(,j_2 mod fc)+i], otherwise. 

(2) 

Due to the assumption that k is even, any clear sub-block 
has a value of zero. This is because the sum ci + . . . + Ck 
becomes an integer multiple of k. Therefore, there is no 
problemfor a sub-block to forget both its index and the value 
when it becomes a clear sub-block. 

C. Details of the layered ILIFC 

The pseudo-code of the encoding map of the layered ILIFC 
is shown below. 
Encoding map 

y = (2/1 I y2 I • • • I ym) = | a;2 | • • • | Xm); 
for (j = 1; j < to; j j + 1) { 
if ((^ clear(a;j)) A (read_index2(a;j) i)) 
{ write2(y0; return(y);} 

} 



for il = 0; I < q- 1; I = 1 + 1) { 
for ij ^1; j < m; j = j + 1) { 
if (clear(a;j) A (read_layer(a;j)== /)) 
{write_new2(«, yj); return(y);} 

} 
} 

return E; 

The clear sub-blocks in an erase block can be considered 
to be a resource for achieving flexibility. Assume that the 
encoder must write the index value z in a sub-block. If there 
is no active sub-block with the index value i, then the encoder 
searches for a clear sub-block in the lowest layer Note that the 
encoder of the original ILIFC can choose a sub-block to write 
the index value i only from among the empty sub-blocks. In 
other words, the existence of clear sub-blocks and this search 
method reduces the nonuniform use of the cell levels. 

The details of the functions used in the pseudo-code are 
given as follows. The function clear(a;) returns true if x 
is clear; otherwise the function returns false. The function 
read_layer(a;) returns the layer index of a; if a; is clear; 
otherwise the function returns —1. 

The function read_index2(a;) is defined as 

read_index2(a;) = I'{x). 

The function write_new2(i, x) is a function to write the 
index value i into the clear sub-block x. This function simply 
increments the value of Ci by one. The function write2(cc) 
changes the value of the active sub-block x by incrementing 
C(»+wt(x)) mod fe where x = (ci,...,Cfe). This rule of the 
change in the value of a sub-block corresponds to the arrows 
in Fig. 12] 

D. Worst case performance 

The worst number of possible rewritings between a consec- 
utive erase operation for the proposed scheme is same as that 
for the original ILIFC. This is because both schemes share 
the same worst case events. A worst case event occurs when 
the cell vector contains one full sub-block and fc — 1 active 
sub-blocks of cell level 1. 

V. Markov chain method 

In this section, we briefly review the Markov chain method 
for average case analysis ||5] and then apply the Markov chain 
method to the analysis on the ILIFC and the Layered ILIFC. 

The concept of the Markov chain method as follows. We 
first construct a state transition diagram for a flash code to be 
analyzed. A state of the diagram is an allowable state of the 
cell state vector Each edge (i.e., state transition) has its own 
probability, which is determined based on the probabilistic 
model of rewriting sequences for the information vector The 
state transition diagram naturally defines a Markov chain 
corresponding the pair of the flash code and the probabilistic 
model. 

The steady state probability of the Markov chain can be 
obtained by solving a simultaneous linear equation system 
defined by the transition probability matrix. The average 
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Fig. 3. An example of the Markov chain for ILIFC (n = 4, fe = 2, m = 
2,<? = 2) 

TABLE I 

Average probability of occurrence of erase operations of the 
LAYERED ILIFC (n = 4, fc = 2, q = 4). 





Pi = 1/2 


Pi = 1/5 


Computer simulation 


0.091006 


0.095430 


Markov chain 


0.091006 


0.095431 



The probability Pi is the probability such that the first bit of the infonnation bit is 
flipped. 



number of possible rewritings is directly derived from the 
steady state probability. 

Figure [3] illustrates the state transition diagram of the ILIFC 
with the parameters n — A,k = 2, and q = 2. The binary 
4-tuples in the boxes are the states of the cell state vector 
in — 4), and the binary 2-tuples in parentheses represent the 
states of the information vector The arcs connecting the boxes 
denote possible state transitions. The dotted arcs correspond 
to the state transitions that induce an erase operation. 

The average probability of the occurrence of an erase 
operation is given by the sum of the probabilities of the state 
transitions corresponding to the dotted arcs. This probability 
can be calculated from the steady state probability. The aver- 
age number of possible rewritings is given as the inverse of 
this probability [5 1. 

If the state space of the ILIFC or the Layered ILIFC is not 
so large, then this Markov chain method is useful to obtain an 
accurate estimate. Table H] compares the average probability 
of an occurrence of an erase operation obtained through 
computer simulation and the Markov chain method. We have 
confirmed that the values obtained by the Markov method and 
the computer simulations are in reasonable agreement. 

VI. Results of computer experiments 

We performed computer experiments to compare the Lay- 
ered ILIFC and the original ILIFC in terms of the number of 
possible rewritings between two consecutive erase operations. 

The assumptions made in the simulations are as follows. In 
unit time, only one bit in the information vector flips, which 
induces a state change in the cell state vector The flipped bit is 
chosen according to the uniform distribution from 1 to fc. The 
same pseudo-random sequences are used for both schemes 
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in order to realize a fair comparison. In the simulation, the 
information vector and the cell state vector are initialized after 
a block erase operation. 

Figure |4] presents histograms for the number of rewritings 
between consecutive erase operations. The horizontal axis 
represents the number of rewritings, and the vertical axis 
denotes the frequency of the number of rewritings observed 
in the experiment. In an experiment, rewriting operations are 
iterated until the number of erase operations becomes 10"*- 
times. The parameter settings are as follows. The number of 
cells in an erase block is n = 16, and each cell can take a 
value from to 7 (i.e., q = 8). The length of the information 
vector is A; = 4, and thus to = 4. 

It is readily observed that the histogram curve of the layered 
ILIFC indicates a desirable behavior of the layered ILIFC. If 
the number of rewritings is not so large (i.e., smaller than 94), 
then the layered ILIFC provides a much smaller frequency 
than the ILIFC. The figure also includes the average number 
of rewritings. The layered ILIFC yields 100.89 rewritings, 
and the original ILIFC yields 93.65 rewritings. On average, 
approximately seven more rewritings are possible with the 
layered ILIFC. 

The Markov chain method presented in Section FV] provides 
accurate values of the expected number of rewritings for the 
layered ILFC and the ILIFC. 

Figure |5] shows the trade-off curves between the code rate 
versus the average number of rewritings. The code rate is 
defined as i? = k/n. The parameters k — 2 and n — A 
are assumed. The layered ILIFC provides a better trade-off 
compared with the original ILIFC. This indicates that the 
proposed scheme has better average case performance than 
that of the original scheme. 

VII. Conclusive summary 

The original ILIFC is an excellent flash code having near 
optimal write deficiency. In the present paper, we revealed 



that a simple modification of the index coding promotes the 
uniform use of the cell level. In addition, we demonstrated that 
the average performance of the ILIFC can be further unproved 
without loss of worst case performance. The Markov chain 
method for the ILIFC may be a useful tool for optimizing the 
detail of the algorithm in terms of average performance. 
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